Overview
Brought to you by YData
Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 974 |
| Missing cells | 2504 |
| Missing cells (%) | 12.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 152.3 KiB |
| Average record size in memory | 160.1 B |
Variable types
| Text | 6 |
|---|---|
| DateTime | 2 |
| Categorical | 9 |
| Numeric | 3 |
STATE has constant value "Massachusetts" | Constant |
CITY is highly overall correlated with COUNTY and 3 other fields | High correlation |
COUNTY is highly overall correlated with CITY and 1 other fields | High correlation |
GENDER is highly overall correlated with PREFIX | High correlation |
LAT is highly overall correlated with CITY | High correlation |
LON is highly overall correlated with CITY | High correlation |
MARITAL is highly overall correlated with PREFIX | High correlation |
PREFIX is highly overall correlated with GENDER and 1 other fields | High correlation |
ZIP is highly overall correlated with CITY and 1 other fields | High correlation |
DEATHDATE has 820 (84.2%) missing values | Missing |
SUFFIX has 953 (97.8%) missing values | Missing |
MAIDEN has 588 (60.4%) missing values | Missing |
ZIP has 142 (14.6%) missing values | Missing |
Id has unique values | Unique |
ADDRESS has unique values | Unique |
LAT has unique values | Unique |
LON has unique values | Unique |
Reproduction
| Analysis started | 2025-11-16 17:41:31.441512 |
|---|---|
| Analysis finished | 2025-11-16 17:41:34.532734 |
| Duration | 3.09 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
Id
Text
Unique
| Distinct | 974 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 974 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 5605b66b-e92d-c16c-1b83-b8bf7040d51f |
|---|---|
| 2nd row | 6e5ae27c-8038-7988-e2c0-25a103f01bfa |
| 3rd row | 8123d076-0886-9007-e956-d5864aa121a7 |
| 4th row | 770518e4-6133-648e-60c9-071eb2f0e2ce |
| 5th row | f96addf5-81b9-0aab-7855-d208d3d352c5 |
| Value | Count | Frequency (%) |
| f2203fd5-1a2c-60cd-cae0-1416658880b6 | 1 | 0.1% |
| 204f8028-72f8-d6f8-761f-79ebf9f02311 | 1 | 0.1% |
| 5605b66b-e92d-c16c-1b83-b8bf7040d51f | 1 | 0.1% |
| 6e5ae27c-8038-7988-e2c0-25a103f01bfa | 1 | 0.1% |
| 8123d076-0886-9007-e956-d5864aa121a7 | 1 | 0.1% |
| 770518e4-6133-648e-60c9-071eb2f0e2ce | 1 | 0.1% |
| f96addf5-81b9-0aab-7855-d208d3d352c5 | 1 | 0.1% |
| 8e9650d1-788a-78f9-4a28-d08f7f95354a | 1 | 0.1% |
| 183df435-4190-060e-8f8e-bf63c572b266 | 1 | 0.1% |
| 720560d4-51da-c38c-ee90-c15935278df1 | 1 | 0.1% |
| Other values (964) | 964 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 3896 | 11.1% |
| 8 | 2059 | 5.9% |
| 0 | 2034 | 5.8% |
| 1 | 2007 | 5.7% |
| f | 1974 | 5.6% |
| 9 | 1971 | 5.6% |
| 3 | 1964 | 5.6% |
| d | 1939 | 5.5% |
| 5 | 1928 | 5.5% |
| b | 1928 | 5.5% |
| Other values (7) | 13364 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35064 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 3896 | 11.1% |
| 8 | 2059 | 5.9% |
| 0 | 2034 | 5.8% |
| 1 | 2007 | 5.7% |
| f | 1974 | 5.6% |
| 9 | 1971 | 5.6% |
| 3 | 1964 | 5.6% |
| d | 1939 | 5.5% |
| 5 | 1928 | 5.5% |
| b | 1928 | 5.5% |
| Other values (7) | 13364 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35064 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 3896 | 11.1% |
| 8 | 2059 | 5.9% |
| 0 | 2034 | 5.8% |
| 1 | 2007 | 5.7% |
| f | 1974 | 5.6% |
| 9 | 1971 | 5.6% |
| 3 | 1964 | 5.6% |
| d | 1939 | 5.5% |
| 5 | 1928 | 5.5% |
| b | 1928 | 5.5% |
| Other values (7) | 13364 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35064 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 3896 | 11.1% |
| 8 | 2059 | 5.9% |
| 0 | 2034 | 5.8% |
| 1 | 2007 | 5.7% |
| f | 1974 | 5.6% |
| 9 | 1971 | 5.6% |
| 3 | 1964 | 5.6% |
| d | 1939 | 5.5% |
| 5 | 1928 | 5.5% |
| b | 1928 | 5.5% |
| Other values (7) | 13364 |
BIRTHDATE
Date
| Distinct | 880 |
|---|---|
| Distinct (%) | 90.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| Minimum | 1922-03-24 00:00:00 |
|---|---|
| Maximum | 1991-11-27 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
DEATHDATE
Date
Missing
| Distinct | 148 |
|---|---|
| Distinct (%) | 96.1% |
| Missing | 820 |
| Missing (%) | 84.2% |
| Memory size | 7.7 KiB |
| Minimum | 2011-02-03 00:00:00 |
|---|---|
| Maximum | 2022-01-27 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
PREFIX
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| Mr. | |
|---|---|
| Mrs. | |
| Ms. |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.3963039 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mrs. |
|---|---|
| 2nd row | Mr. |
| 3rd row | Mr. |
| 4th row | Mr. |
| 5th row | Mr. |
Common Values
| Value | Count | Frequency (%) |
| Mr. | 494 | |
| Mrs. | 386 | |
| Ms. | 94 | 9.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mr | 494 | |
| mrs | 386 | |
| ms | 94 | 9.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 974 | |
| . | 974 | |
| r | 880 | |
| s | 480 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3308 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 974 | |
| . | 974 | |
| r | 880 | |
| s | 480 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3308 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 974 | |
| . | 974 | |
| r | 880 | |
| s | 480 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3308 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 974 | |
| . | 974 | |
| r | 880 | |
| s | 480 |
FIRST
Text
| Distinct | 842 |
|---|---|
| Distinct (%) | 86.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 8.8737166 |
| Min length | 5 |
Unique
| Unique | 721 ? |
|---|---|
| Unique (%) | 74.0% |
Sample
| 1st row | Nikita578 |
|---|---|
| 2nd row | Zane918 |
| 3rd row | Quinn173 |
| 4th row | Abel832 |
| 5th row | Edwin773 |
| Value | Count | Frequency (%) |
| josé | 7 | 0.7% |
| gregorio366 | 3 | 0.3% |
| chris95 | 3 | 0.3% |
| lazaro919 | 3 | 0.3% |
| yolanda648 | 3 | 0.3% |
| domenic627 | 3 | 0.3% |
| emilio366 | 3 | 0.3% |
| bernardo699 | 3 | 0.3% |
| armando772 | 3 | 0.3% |
| travis723 | 3 | 0.3% |
| Other values (836) | 951 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 690 | 8.0% |
| e | 606 | 7.0% |
| n | 494 | 5.7% |
| i | 452 | 5.2% |
| r | 450 | 5.2% |
| o | 360 | 4.2% |
| l | 342 | 4.0% |
| 4 | 328 | 3.8% |
| 1 | 322 | 3.7% |
| 6 | 310 | 3.6% |
| Other values (57) | 4289 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8643 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 690 | 8.0% |
| e | 606 | 7.0% |
| n | 494 | 5.7% |
| i | 452 | 5.2% |
| r | 450 | 5.2% |
| o | 360 | 4.2% |
| l | 342 | 4.0% |
| 4 | 328 | 3.8% |
| 1 | 322 | 3.7% |
| 6 | 310 | 3.6% |
| Other values (57) | 4289 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8643 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 690 | 8.0% |
| e | 606 | 7.0% |
| n | 494 | 5.7% |
| i | 452 | 5.2% |
| r | 450 | 5.2% |
| o | 360 | 4.2% |
| l | 342 | 4.0% |
| 4 | 328 | 3.8% |
| 1 | 322 | 3.7% |
| 6 | 310 | 3.6% |
| Other values (57) | 4289 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8643 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 690 | 8.0% |
| e | 606 | 7.0% |
| n | 494 | 5.7% |
| i | 452 | 5.2% |
| r | 450 | 5.2% |
| o | 360 | 4.2% |
| l | 342 | 4.0% |
| 4 | 328 | 3.8% |
| 1 | 322 | 3.7% |
| 6 | 310 | 3.6% |
| Other values (57) | 4289 |
LAST
Text
| Distinct | 498 |
|---|---|
| Distinct (%) | 51.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 9.4681725 |
| Min length | 6 |
Unique
| Unique | 219 ? |
|---|---|
| Unique (%) | 22.5% |
Sample
| 1st row | Erdman779 |
|---|---|
| 2nd row | Hodkiewicz467 |
| 3rd row | Marquardt819 |
| 4th row | Smitham825 |
| 5th row | Labadie908 |
| Value | Count | Frequency (%) |
| heaney114 | 6 | 0.6% |
| rempel203 | 5 | 0.5% |
| walter473 | 5 | 0.5% |
| schowalter414 | 5 | 0.5% |
| mcclure239 | 5 | 0.5% |
| trantow673 | 5 | 0.5% |
| weissnat378 | 5 | 0.5% |
| wolf938 | 5 | 0.5% |
| hermann103 | 5 | 0.5% |
| gleason633 | 5 | 0.5% |
| Other values (489) | 924 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 683 | 7.4% |
| r | 539 | 5.8% |
| a | 504 | 5.5% |
| n | 438 | 4.7% |
| i | 390 | 4.2% |
| o | 370 | 4.0% |
| l | 345 | 3.7% |
| 9 | 322 | 3.5% |
| 7 | 318 | 3.4% |
| 1 | 316 | 3.4% |
| Other values (58) | 4997 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9222 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 683 | 7.4% |
| r | 539 | 5.8% |
| a | 504 | 5.5% |
| n | 438 | 4.7% |
| i | 390 | 4.2% |
| o | 370 | 4.0% |
| l | 345 | 3.7% |
| 9 | 322 | 3.5% |
| 7 | 318 | 3.4% |
| 1 | 316 | 3.4% |
| Other values (58) | 4997 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9222 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 683 | 7.4% |
| r | 539 | 5.8% |
| a | 504 | 5.5% |
| n | 438 | 4.7% |
| i | 390 | 4.2% |
| o | 370 | 4.0% |
| l | 345 | 3.7% |
| 9 | 322 | 3.5% |
| 7 | 318 | 3.4% |
| 1 | 316 | 3.4% |
| Other values (58) | 4997 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9222 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 683 | 7.4% |
| r | 539 | 5.8% |
| a | 504 | 5.5% |
| n | 438 | 4.7% |
| i | 390 | 4.2% |
| o | 370 | 4.0% |
| l | 345 | 3.7% |
| 9 | 322 | 3.5% |
| 7 | 318 | 3.4% |
| 1 | 316 | 3.4% |
| Other values (58) | 4997 |
SUFFIX
Categorical
Missing
| Distinct | 3 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 953 |
| Missing (%) | 97.8% |
| Memory size | 7.7 KiB |
| PhD | |
|---|---|
| JD | |
| MD |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.4761905 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PhD |
|---|---|
| 2nd row | JD |
| 3rd row | PhD |
| 4th row | JD |
| 5th row | JD |
Common Values
| Value | Count | Frequency (%) |
| PhD | 10 | 1.0% |
| JD | 8 | 0.8% |
| MD | 3 | 0.3% |
| (Missing) | 953 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| phd | 10 | |
| jd | 8 | |
| md | 3 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 21 | |
| P | 10 | |
| h | 10 | |
| J | 8 | 15.4% |
| M | 3 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 52 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 21 | |
| P | 10 | |
| h | 10 | |
| J | 8 | 15.4% |
| M | 3 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 52 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 21 | |
| P | 10 | |
| h | 10 | |
| J | 8 | 15.4% |
| M | 3 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 52 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 21 | |
| P | 10 | |
| h | 10 | |
| J | 8 | 15.4% |
| M | 3 | 5.8% |
MAIDEN
Text
Missing
| Distinct | 279 |
|---|---|
| Distinct (%) | 72.3% |
| Missing | 588 |
| Missing (%) | 60.4% |
| Memory size | 7.7 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 13 |
| Mean length | 9.4766839 |
| Min length | 6 |
Unique
| Unique | 198 ? |
|---|---|
| Unique (%) | 51.3% |
Sample
| 1st row | Leannon79 |
|---|---|
| 2nd row | Wiegand701 |
| 3rd row | Predovic534 |
| 4th row | Walker122 |
| 5th row | Upton904 |
| Value | Count | Frequency (%) |
| jerde200 | 5 | 1.3% |
| kshlerin58 | 4 | 1.0% |
| mayert710 | 3 | 0.8% |
| yundt842 | 3 | 0.8% |
| durgan499 | 3 | 0.8% |
| lehner980 | 3 | 0.8% |
| muller251 | 3 | 0.8% |
| simonis280 | 3 | 0.8% |
| hegmann834 | 3 | 0.8% |
| volkman526 | 3 | 0.8% |
| Other values (269) | 353 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 284 | 7.8% |
| r | 201 | 5.5% |
| n | 193 | 5.3% |
| a | 184 | 5.0% |
| o | 157 | 4.3% |
| i | 144 | 3.9% |
| l | 142 | 3.9% |
| 9 | 135 | 3.7% |
| 1 | 129 | 3.5% |
| s | 124 | 3.4% |
| Other values (58) | 1965 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3658 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 284 | 7.8% |
| r | 201 | 5.5% |
| n | 193 | 5.3% |
| a | 184 | 5.0% |
| o | 157 | 4.3% |
| i | 144 | 3.9% |
| l | 142 | 3.9% |
| 9 | 135 | 3.7% |
| 1 | 129 | 3.5% |
| s | 124 | 3.4% |
| Other values (58) | 1965 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3658 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 284 | 7.8% |
| r | 201 | 5.5% |
| n | 193 | 5.3% |
| a | 184 | 5.0% |
| o | 157 | 4.3% |
| i | 144 | 3.9% |
| l | 142 | 3.9% |
| 9 | 135 | 3.7% |
| 1 | 129 | 3.5% |
| s | 124 | 3.4% |
| Other values (58) | 1965 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3658 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 284 | 7.8% |
| r | 201 | 5.5% |
| n | 193 | 5.3% |
| a | 184 | 5.0% |
| o | 157 | 4.3% |
| i | 144 | 3.9% |
| l | 142 | 3.9% |
| 9 | 135 | 3.7% |
| 1 | 129 | 3.5% |
| s | 124 | 3.4% |
| Other values (58) | 1965 |
MARITAL
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 7.7 KiB |
| M | |
|---|---|
| S |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 784 | |
| S | 189 | 19.4% |
| (Missing) | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 784 | |
| s | 189 | 19.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 784 | |
| S | 189 | 19.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 973 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 784 | |
| S | 189 | 19.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 973 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 784 | |
| S | 189 | 19.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 973 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 784 | |
| S | 189 | 19.4% |
RACE
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| white | |
|---|---|
| black | |
| asian | |
| other | 16 |
| hawaiian | 13 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 5.0513347 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | white |
|---|---|
| 2nd row | white |
| 3rd row | white |
| 4th row | white |
| 5th row | white |
Common Values
| Value | Count | Frequency (%) |
| white | 680 | |
| black | 163 | 16.7% |
| asian | 91 | 9.3% |
| other | 16 | 1.6% |
| hawaiian | 13 | 1.3% |
| native | 11 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 680 | |
| black | 163 | 16.7% |
| asian | 91 | 9.3% |
| other | 16 | 1.6% |
| hawaiian | 13 | 1.3% |
| native | 11 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 808 | |
| h | 709 | |
| t | 707 | |
| e | 707 | |
| w | 693 | |
| a | 395 | |
| b | 163 | 3.3% |
| l | 163 | 3.3% |
| c | 163 | 3.3% |
| k | 163 | 3.3% |
| Other values (5) | 249 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4920 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 808 | |
| h | 709 | |
| t | 707 | |
| e | 707 | |
| w | 693 | |
| a | 395 | |
| b | 163 | 3.3% |
| l | 163 | 3.3% |
| c | 163 | 3.3% |
| k | 163 | 3.3% |
| Other values (5) | 249 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4920 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 808 | |
| h | 709 | |
| t | 707 | |
| e | 707 | |
| w | 693 | |
| a | 395 | |
| b | 163 | 3.3% |
| l | 163 | 3.3% |
| c | 163 | 3.3% |
| k | 163 | 3.3% |
| Other values (5) | 249 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4920 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 808 | |
| h | 709 | |
| t | 707 | |
| e | 707 | |
| w | 693 | |
| a | 395 | |
| b | 163 | 3.3% |
| l | 163 | 3.3% |
| c | 163 | 3.3% |
| k | 163 | 3.3% |
| Other values (5) | 249 | 5.1% |
ETHNICITY
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| nonhispanic | |
|---|---|
| hispanic |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.411704 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | nonhispanic |
|---|---|
| 2nd row | nonhispanic |
| 3rd row | nonhispanic |
| 4th row | hispanic |
| 5th row | hispanic |
Common Values
| Value | Count | Frequency (%) |
| nonhispanic | 783 | |
| hispanic | 191 | 19.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nonhispanic | 783 | |
| hispanic | 191 | 19.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2540 | |
| i | 1948 | |
| h | 974 | 9.6% |
| s | 974 | 9.6% |
| a | 974 | 9.6% |
| p | 974 | 9.6% |
| c | 974 | 9.6% |
| o | 783 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10141 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 2540 | |
| i | 1948 | |
| h | 974 | 9.6% |
| s | 974 | 9.6% |
| a | 974 | 9.6% |
| p | 974 | 9.6% |
| c | 974 | 9.6% |
| o | 783 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10141 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 2540 | |
| i | 1948 | |
| h | 974 | 9.6% |
| s | 974 | 9.6% |
| a | 974 | 9.6% |
| p | 974 | 9.6% |
| c | 974 | 9.6% |
| o | 783 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10141 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 2540 | |
| i | 1948 | |
| h | 974 | 9.6% |
| s | 974 | 9.6% |
| a | 974 | 9.6% |
| p | 974 | 9.6% |
| c | 974 | 9.6% |
| o | 783 | 7.7% |
GENDER
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| M | |
|---|---|
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 494 | |
| F | 480 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 494 | |
| f | 480 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 494 | |
| F | 480 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 974 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 494 | |
| F | 480 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 974 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 494 | |
| F | 480 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 974 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 494 | |
| F | 480 |
BIRTHPLACE
Text
| Distinct | 297 |
|---|---|
| Distinct (%) | 30.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
Length
| Max length | 41 |
|---|---|
| Median length | 37 |
| Mean length | 27.022587 |
| Min length | 15 |
Unique
| Unique | 106 ? |
|---|---|
| Unique (%) | 10.9% |
Sample
| 1st row | Wakefield Massachusetts US |
|---|---|
| 2nd row | Brookline Massachusetts US |
| 3rd row | Gardner Massachusetts US |
| 4th row | Randolph Massachusetts US |
| 5th row | Stow Massachusetts US |
| Value | Count | Frequency (%) |
| massachusetts | 845 | |
| us | 845 | |
| boston | 79 | 2.5% |
| worcester | 20 | 0.6% |
| springfield | 20 | 0.6% |
| bogota | 18 | 0.6% |
| somerville | 17 | 0.5% |
| dm | 17 | 0.5% |
| saint | 17 | 0.5% |
| north | 17 | 0.5% |
| Other values (359) | 1223 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4092 | ||
| s | 3762 | |
| a | 2379 | 9.0% |
| t | 2287 | 8.7% |
| e | 1685 | 6.4% |
| h | 1138 | 4.3% |
| u | 1108 | 4.2% |
| c | 1023 | 3.9% |
| S | 1000 | 3.8% |
| M | 970 | 3.7% |
| Other values (55) | 6876 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 26320 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4092 | ||
| s | 3762 | |
| a | 2379 | 9.0% |
| t | 2287 | 8.7% |
| e | 1685 | 6.4% |
| h | 1138 | 4.3% |
| u | 1108 | 4.2% |
| c | 1023 | 3.9% |
| S | 1000 | 3.8% |
| M | 970 | 3.7% |
| Other values (55) | 6876 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 26320 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4092 | ||
| s | 3762 | |
| a | 2379 | 9.0% |
| t | 2287 | 8.7% |
| e | 1685 | 6.4% |
| h | 1138 | 4.3% |
| u | 1108 | 4.2% |
| c | 1023 | 3.9% |
| S | 1000 | 3.8% |
| M | 970 | 3.7% |
| Other values (55) | 6876 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 26320 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4092 | ||
| s | 3762 | |
| a | 2379 | 9.0% |
| t | 2287 | 8.7% |
| e | 1685 | 6.4% |
| h | 1138 | 4.3% |
| u | 1108 | 4.2% |
| c | 1023 | 3.9% |
| S | 1000 | 3.8% |
| M | 970 | 3.7% |
| Other values (55) | 6876 |
ADDRESS
Text
Unique
| Distinct | 974 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
Length
| Max length | 34 |
|---|---|
| Median length | 27 |
| Mean length | 21.034908 |
| Min length | 12 |
Unique
| Unique | 974 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 510 Little Station Unit 69 |
|---|---|
| 2nd row | 747 Conn Throughway |
| 3rd row | 816 Okuneva Extension Apt 91 |
| 4th row | 127 Cole Way Unit 95 |
| 5th row | 976 Ziemann Gateway |
| Value | Count | Frequency (%) |
| unit | 159 | 4.1% |
| apt | 157 | 4.1% |
| suite | 153 | 4.0% |
| road | 19 | 0.5% |
| parade | 19 | 0.5% |
| path | 19 | 0.5% |
| boulevard | 18 | 0.5% |
| lane | 17 | 0.4% |
| trafficway | 16 | 0.4% |
| gate | 15 | 0.4% |
| Other values (1243) | 3279 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2897 | 14.1% | |
| e | 1389 | 6.8% |
| a | 1107 | 5.4% |
| t | 987 | 4.8% |
| i | 953 | 4.7% |
| r | 941 | 4.6% |
| n | 923 | 4.5% |
| o | 723 | 3.5% |
| l | 622 | 3.0% |
| u | 466 | 2.3% |
| Other values (52) | 9480 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 20488 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2897 | 14.1% | |
| e | 1389 | 6.8% |
| a | 1107 | 5.4% |
| t | 987 | 4.8% |
| i | 953 | 4.7% |
| r | 941 | 4.6% |
| n | 923 | 4.5% |
| o | 723 | 3.5% |
| l | 622 | 3.0% |
| u | 466 | 2.3% |
| Other values (52) | 9480 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 20488 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2897 | 14.1% | |
| e | 1389 | 6.8% |
| a | 1107 | 5.4% |
| t | 987 | 4.8% |
| i | 953 | 4.7% |
| r | 941 | 4.6% |
| n | 923 | 4.5% |
| o | 723 | 3.5% |
| l | 622 | 3.0% |
| u | 466 | 2.3% |
| Other values (52) | 9480 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 20488 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2897 | 14.1% | |
| e | 1389 | 6.8% |
| a | 1107 | 5.4% |
| t | 987 | 4.8% |
| i | 953 | 4.7% |
| r | 941 | 4.6% |
| n | 923 | 4.5% |
| o | 723 | 3.5% |
| l | 622 | 3.0% |
| u | 466 | 2.3% |
| Other values (52) | 9480 |
CITY
Categorical
High correlation
| Distinct | 29 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| Boston | |
|---|---|
| Quincy | |
| Cambridge | 45 |
| Revere | 42 |
| Chelsea | 39 |
| Other values (24) |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.5954825 |
| Min length | 4 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Quincy |
|---|---|
| 2nd row | Boston |
| 3rd row | Quincy |
| 4th row | Boston |
| 5th row | Boston |
Common Values
| Value | Count | Frequency (%) |
| Boston | 541 | |
| Quincy | 80 | 8.2% |
| Cambridge | 45 | 4.6% |
| Revere | 42 | 4.3% |
| Chelsea | 39 | 4.0% |
| Weymouth | 37 | 3.8% |
| Somerville | 25 | 2.6% |
| Hingham | 22 | 2.3% |
| Winthrop | 22 | 2.3% |
| Brookline | 17 | 1.7% |
| Other values (19) | 104 | 10.7% |
Length
| Value | Count | Frequency (%) |
| boston | 541 | |
| quincy | 80 | 8.2% |
| cambridge | 45 | 4.6% |
| revere | 42 | 4.3% |
| chelsea | 39 | 4.0% |
| weymouth | 37 | 3.8% |
| somerville | 25 | 2.6% |
| hingham | 22 | 2.3% |
| winthrop | 22 | 2.3% |
| brookline | 17 | 1.7% |
| Other values (19) | 107 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1242 | |
| n | 719 | |
| t | 694 | |
| s | 602 | |
| B | 569 | |
| e | 468 | 7.3% |
| i | 237 | 3.7% |
| r | 207 | 3.2% |
| a | 155 | 2.4% |
| l | 151 | 2.4% |
| Other values (24) | 1380 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6424 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1242 | |
| n | 719 | |
| t | 694 | |
| s | 602 | |
| B | 569 | |
| e | 468 | 7.3% |
| i | 237 | 3.7% |
| r | 207 | 3.2% |
| a | 155 | 2.4% |
| l | 151 | 2.4% |
| Other values (24) | 1380 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6424 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1242 | |
| n | 719 | |
| t | 694 | |
| s | 602 | |
| B | 569 | |
| e | 468 | 7.3% |
| i | 237 | 3.7% |
| r | 207 | 3.2% |
| a | 155 | 2.4% |
| l | 151 | 2.4% |
| Other values (24) | 1380 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6424 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1242 | |
| n | 719 | |
| t | 694 | |
| s | 602 | |
| B | 569 | |
| e | 468 | 7.3% |
| i | 237 | 3.7% |
| r | 207 | 3.2% |
| a | 155 | 2.4% |
| l | 151 | 2.4% |
| Other values (24) | 1380 |
STATE
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| Massachusetts |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Massachusetts |
|---|---|
| 2nd row | Massachusetts |
| 3rd row | Massachusetts |
| 4th row | Massachusetts |
| 5th row | Massachusetts |
Common Values
| Value | Count | Frequency (%) |
| Massachusetts | 974 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| massachusetts | 974 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 3896 | |
| a | 1948 | |
| t | 1948 | |
| M | 974 | 7.7% |
| c | 974 | 7.7% |
| h | 974 | 7.7% |
| u | 974 | 7.7% |
| e | 974 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12662 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 3896 | |
| a | 1948 | |
| t | 1948 | |
| M | 974 | 7.7% |
| c | 974 | 7.7% |
| h | 974 | 7.7% |
| u | 974 | 7.7% |
| e | 974 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12662 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 3896 | |
| a | 1948 | |
| t | 1948 | |
| M | 974 | 7.7% |
| c | 974 | 7.7% |
| h | 974 | 7.7% |
| u | 974 | 7.7% |
| e | 974 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12662 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 3896 | |
| a | 1948 | |
| t | 1948 | |
| M | 974 | 7.7% |
| c | 974 | 7.7% |
| h | 974 | 7.7% |
| u | 974 | 7.7% |
| e | 974 | 7.7% |
COUNTY
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.7 KiB |
| Suffolk County | |
|---|---|
| Norfolk County | |
| Middlesex County | |
| Plymouth County | 49 |
| Essex County | 1 |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 14.304928 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Norfolk County |
|---|---|
| 2nd row | Suffolk County |
| 3rd row | Norfolk County |
| 4th row | Suffolk County |
| 5th row | Suffolk County |
Common Values
| Value | Count | Frequency (%) |
| Suffolk County | 644 | |
| Norfolk County | 155 | 15.9% |
| Middlesex County | 125 | 12.8% |
| Plymouth County | 49 | 5.0% |
| Essex County | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| county | 974 | |
| suffolk | 644 | |
| norfolk | 155 | 8.0% |
| middlesex | 125 | 6.4% |
| plymouth | 49 | 2.5% |
| essex | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1977 | |
| u | 1667 | |
| f | 1443 | |
| t | 1023 | |
| y | 1023 | |
| 974 | ||
| n | 974 | |
| C | 974 | |
| l | 973 | |
| k | 799 | 5.7% |
| Other values (13) | 2106 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13933 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1977 | |
| u | 1667 | |
| f | 1443 | |
| t | 1023 | |
| y | 1023 | |
| 974 | ||
| n | 974 | |
| C | 974 | |
| l | 973 | |
| k | 799 | 5.7% |
| Other values (13) | 2106 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13933 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1977 | |
| u | 1667 | |
| f | 1443 | |
| t | 1023 | |
| y | 1023 | |
| 974 | ||
| n | 974 | |
| C | 974 | |
| l | 973 | |
| k | 799 | 5.7% |
| Other values (13) | 2106 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13933 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1977 | |
| u | 1667 | |
| f | 1443 | |
| t | 1023 | |
| y | 1023 | |
| 974 | ||
| n | 974 | |
| C | 974 | |
| l | 973 | |
| k | 799 | 5.7% |
| Other values (13) | 2106 |
ZIP
Real number (ℝ)
High correlation Missing
| Distinct | 70 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 142 |
| Missing (%) | 14.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2152.4219 |
| Minimum | 1801 |
|---|---|
| Maximum | 2472 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.7 KiB |
Quantile statistics
| Minimum | 1801 |
|---|---|
| 5-th percentile | 2108 |
| Q1 | 2121 |
| median | 2135 |
| Q3 | 2163 |
| 95-th percentile | 2215 |
| Maximum | 2472 |
| Range | 671 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 75.462146 |
|---|---|
| Coefficient of variation (CV) | 0.03505918 |
| Kurtosis | 10.687414 |
| Mean | 2152.4219 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 2.657886 |
| Sum | 1790815 |
| Variance | 5694.5354 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2151 | 41 | 4.2% |
| 2124 | 27 | 2.8% |
| 2152 | 27 | 2.8% |
| 2125 | 26 | 2.7% |
| 2128 | 23 | 2.4% |
| 2149 | 19 | 2.0% |
| 2121 | 19 | 2.0% |
| 2111 | 19 | 2.0% |
| 2114 | 19 | 2.0% |
| 2132 | 18 | 1.8% |
| Other values (60) | 594 | |
| (Missing) | 142 | 14.6% |
| Value | Count | Frequency (%) |
| 1801 | 1 | 0.1% |
| 1867 | 1 | 0.1% |
| 1890 | 1 | 0.1% |
| 2043 | 15 | |
| 2045 | 10 | |
| 2060 | 3 | 0.3% |
| 2066 | 5 | 0.5% |
| 2108 | 15 | |
| 2109 | 12 | |
| 2110 | 12 |
| Value | Count | Frequency (%) |
| 2472 | 7 | 0.7% |
| 2468 | 1 | 0.1% |
| 2467 | 18 | |
| 2466 | 1 | 0.1% |
| 2460 | 2 | 0.2% |
| 2459 | 2 | 0.2% |
| 2453 | 1 | 0.1% |
| 2446 | 1 | 0.1% |
| 2445 | 2 | 0.2% |
| 2215 | 15 |
LAT
Real number (ℝ)
High correlation Unique
| Distinct | 974 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.337359 |
| Minimum | 42.204921 |
|---|---|
| Maximum | 42.495464 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.7 KiB |
Quantile statistics
| Minimum | 42.204921 |
|---|---|
| 5-th percentile | 42.242428 |
| Q1 | 42.313366 |
| median | 42.343794 |
| Q3 | 42.371594 |
| 95-th percentile | 42.399264 |
| Maximum | 42.495464 |
| Range | 0.29054326 |
| Interquartile range (IQR) | 0.058228083 |
Descriptive statistics
| Standard deviation | 0.047593898 |
|---|---|
| Coefficient of variation (CV) | 0.0011241584 |
| Kurtosis | 0.51206075 |
| Mean | 42.337359 |
| Median Absolute Deviation (MAD) | 0.029053783 |
| Skewness | -0.51212007 |
| Sum | 41236.587 |
| Variance | 0.0022651792 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42.35733297 | 1 | 0.1% |
| 42.29093738 | 1 | 0.1% |
| 42.3088312 | 1 | 0.1% |
| 42.26517685 | 1 | 0.1% |
| 42.33430374 | 1 | 0.1% |
| 42.3467714 | 1 | 0.1% |
| 42.37102647 | 1 | 0.1% |
| 42.35892761 | 1 | 0.1% |
| 42.29790394 | 1 | 0.1% |
| 42.38408414 | 1 | 0.1% |
| Other values (964) | 964 |
| Value | Count | Frequency (%) |
| 42.20492119 | 1 | |
| 42.20830838 | 1 | |
| 42.20934207 | 1 | |
| 42.20961107 | 1 | |
| 42.21071825 | 1 | |
| 42.21152676 | 1 | |
| 42.21186802 | 1 | |
| 42.21254372 | 1 | |
| 42.21338876 | 1 | |
| 42.21435846 | 1 |
| Value | Count | Frequency (%) |
| 42.49546445 | 1 | |
| 42.49184849 | 1 | |
| 42.48727483 | 1 | |
| 42.47695486 | 1 | |
| 42.47244163 | 1 | |
| 42.45625452 | 1 | |
| 42.45475039 | 1 | |
| 42.45389859 | 1 | |
| 42.45177113 | 1 | |
| 42.446131 | 1 |
LON
Real number (ℝ)
High correlation Unique
| Distinct | 974 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -71.027524 |
| Minimum | -71.165648 |
|---|---|
| Maximum | -70.730824 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 974 |
| Negative (%) | 100.0% |
| Memory size | 7.7 KiB |
Quantile statistics
| Minimum | -71.165648 |
|---|---|
| 5-th percentile | -71.127169 |
| Q1 | -71.068002 |
| median | -71.038397 |
| Q3 | -70.999202 |
| 95-th percentile | -70.897932 |
| Maximum | -70.730824 |
| Range | 0.4348238 |
| Interquartile range (IQR) | 0.068800045 |
Descriptive statistics
| Standard deviation | 0.06937506 |
|---|---|
| Coefficient of variation (CV) | -0.00097673489 |
| Kurtosis | 2.6350975 |
| Mean | -71.027524 |
| Median Absolute Deviation (MAD) | 0.03489828 |
| Skewness | 1.1874088 |
| Sum | -69180.808 |
| Variance | 0.004812899 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -71.05737164 | 1 | 0.1% |
| -70.97550306 | 1 | 0.1% |
| -71.0631616 | 1 | 0.1% |
| -70.96708508 | 1 | 0.1% |
| -71.0668012 | 1 | 0.1% |
| -71.05881297 | 1 | 0.1% |
| -71.11810672 | 1 | 0.1% |
| -71.15622361 | 1 | 0.1% |
| -71.01598304 | 1 | 0.1% |
| -71.1006892 | 1 | 0.1% |
| Other values (964) | 964 |
| Value | Count | Frequency (%) |
| -71.16564805 | 1 | |
| -71.16512791 | 1 | |
| -71.16380455 | 1 | |
| -71.16136896 | 1 | |
| -71.16058687 | 1 | |
| -71.15925936 | 1 | |
| -71.15877407 | 1 | |
| -71.15717571 | 1 | |
| -71.15622361 | 1 | |
| -71.15545553 | 1 |
| Value | Count | Frequency (%) |
| -70.73082425 | 1 | |
| -70.73471526 | 1 | |
| -70.74154837 | 1 | |
| -70.74164746 | 1 | |
| -70.74205298 | 1 | |
| -70.74838736 | 1 | |
| -70.75239647 | 1 | |
| -70.76346235 | 1 | |
| -70.77260577 | 1 | |
| -70.77347668 | 1 |
Interactions
Correlations
| CITY | COUNTY | ETHNICITY | GENDER | LAT | LON | MARITAL | PREFIX | RACE | SUFFIX | ZIP | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| CITY | 1.000 | 0.988 | 0.203 | 0.068 | 0.519 | 0.530 | 0.000 | 0.026 | 0.080 | 0.000 | 0.812 |
| COUNTY | 0.988 | 1.000 | 0.238 | 0.065 | 0.476 | 0.459 | 0.000 | 0.043 | 0.103 | 0.099 | 0.694 |
| ETHNICITY | 0.203 | 0.238 | 1.000 | 0.031 | 0.056 | 0.093 | 0.054 | 0.081 | 0.102 | 0.000 | 0.154 |
| GENDER | 0.068 | 0.065 | 0.031 | 1.000 | 0.133 | 0.041 | 0.000 | 0.999 | 0.000 | 0.000 | 0.022 |
| LAT | 0.519 | 0.476 | 0.056 | 0.133 | 1.000 | -0.260 | 0.000 | 0.070 | 0.056 | 0.000 | -0.023 |
| LON | 0.530 | 0.459 | 0.093 | 0.041 | -0.260 | 1.000 | 0.066 | 0.087 | 0.033 | 0.000 | 0.027 |
| MARITAL | 0.000 | 0.000 | 0.054 | 0.000 | 0.000 | 0.066 | 1.000 | 0.703 | 0.032 | 0.000 | 0.000 |
| PREFIX | 0.026 | 0.043 | 0.081 | 0.999 | 0.070 | 0.087 | 0.703 | 1.000 | 0.000 | 0.000 | 0.000 |
| RACE | 0.080 | 0.103 | 0.102 | 0.000 | 0.056 | 0.033 | 0.032 | 0.000 | 1.000 | 0.000 | 0.066 |
| SUFFIX | 0.000 | 0.099 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
| ZIP | 0.812 | 0.694 | 0.154 | 0.022 | -0.023 | 0.027 | 0.000 | 0.000 | 0.066 | 0.000 | 1.000 |
Missing values
Sample
| Id | BIRTHDATE | DEATHDATE | PREFIX | FIRST | LAST | SUFFIX | MAIDEN | MARITAL | RACE | ETHNICITY | GENDER | BIRTHPLACE | ADDRESS | CITY | STATE | COUNTY | ZIP | LAT | LON | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5605b66b-e92d-c16c-1b83-b8bf7040d51f | 1977-03-19 | NaT | Mrs. | Nikita578 | Erdman779 | NaN | Leannon79 | M | white | nonhispanic | F | Wakefield Massachusetts US | 510 Little Station Unit 69 | Quincy | Massachusetts | Norfolk County | 2186.0 | 42.290937 | -70.975503 |
| 1 | 6e5ae27c-8038-7988-e2c0-25a103f01bfa | 1940-02-19 | NaT | Mr. | Zane918 | Hodkiewicz467 | NaN | NaN | M | white | nonhispanic | M | Brookline Massachusetts US | 747 Conn Throughway | Boston | Massachusetts | Suffolk County | 2135.0 | 42.308831 | -71.063162 |
| 2 | 8123d076-0886-9007-e956-d5864aa121a7 | 1958-06-04 | NaT | Mr. | Quinn173 | Marquardt819 | NaN | NaN | M | white | nonhispanic | M | Gardner Massachusetts US | 816 Okuneva Extension Apt 91 | Quincy | Massachusetts | Norfolk County | 2170.0 | 42.265177 | -70.967085 |
| 3 | 770518e4-6133-648e-60c9-071eb2f0e2ce | 1928-12-25 | 2017-09-29 | Mr. | Abel832 | Smitham825 | NaN | NaN | M | white | hispanic | M | Randolph Massachusetts US | 127 Cole Way Unit 95 | Boston | Massachusetts | Suffolk County | 2118.0 | 42.334304 | -71.066801 |
| 4 | f96addf5-81b9-0aab-7855-d208d3d352c5 | 1928-12-25 | 2014-02-23 | Mr. | Edwin773 | Labadie908 | NaN | NaN | M | white | hispanic | M | Stow Massachusetts US | 976 Ziemann Gateway | Boston | Massachusetts | Suffolk County | 2125.0 | 42.346771 | -71.058813 |
| 5 | 8e9650d1-788a-78f9-4a28-d08f7f95354a | 1928-12-25 | NaT | Mr. | Frankie174 | Oberbrunner298 | NaN | NaN | M | white | hispanic | M | Boston Massachusetts US | 303 Bechtelar Bypass Suite 26 | Boston | Massachusetts | Suffolk County | 2467.0 | 42.371026 | -71.118107 |
| 6 | 183df435-4190-060e-8f8e-bf63c572b266 | 1957-11-08 | NaT | Mrs. | Eilene124 | Walsh511 | NaN | Wiegand701 | M | asian | nonhispanic | F | Beijing Beijing Municipality CN | 235 Lang Parade | Cambridge | Massachusetts | Middlesex County | 2142.0 | 42.358928 | -71.156224 |
| 7 | 720560d4-51da-c38c-ee90-c15935278df1 | 1972-06-27 | NaT | Mr. | Lowell343 | Price929 | NaN | NaN | M | white | nonhispanic | M | Lowell Massachusetts US | 694 Kuhlman Corner Apt 74 | Quincy | Massachusetts | Norfolk County | 2170.0 | 42.297904 | -71.015983 |
| 8 | 217851b0-5f47-d376-18b9-0fe4ba77207e | 1954-03-06 | NaT | Mr. | Adrian111 | Gleason633 | NaN | NaN | S | black | hispanic | M | Boston Massachusetts US | 808 Gottlieb Wall | Boston | Massachusetts | Suffolk County | 2126.0 | 42.384084 | -71.100689 |
| 9 | ff331e5c-ab16-e218-f39a-63e11de1ed75 | 1927-07-10 | NaT | Mr. | Eugene421 | Abernathy524 | NaN | NaN | M | native | hispanic | M | Pembroke Massachusetts US | 706 Connelly Track Unit 1 | Boston | Massachusetts | Suffolk County | 2111.0 | 42.358519 | -71.078598 |
| Id | BIRTHDATE | DEATHDATE | PREFIX | FIRST | LAST | SUFFIX | MAIDEN | MARITAL | RACE | ETHNICITY | GENDER | BIRTHPLACE | ADDRESS | CITY | STATE | COUNTY | ZIP | LAT | LON | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 964 | c06513da-7f35-b4eb-5bab-28c87ff97a10 | 1959-03-04 | 2014-11-16 | Mr. | Tomas436 | Hermann103 | NaN | NaN | M | asian | nonhispanic | M | Worcester Massachusetts US | 754 Pfannerstill Park | Medford | Massachusetts | Middlesex County | 2155.0 | 42.416156 | -71.125391 |
| 965 | 6b6e8cda-703a-6a17-4943-b3bd1d709084 | 1944-03-08 | NaT | Mr. | Nigel915 | Carroll471 | NaN | NaN | S | white | nonhispanic | M | Milford Massachusetts US | 380 Wintheiser Run Apt 92 | Boston | Massachusetts | Suffolk County | 2110.0 | 42.391092 | -71.042205 |
| 966 | a2b011b1-ca0a-69fa-6906-6c06ad18376a | 1944-06-22 | NaT | Mr. | Gerald181 | Murray856 | NaN | NaN | M | white | nonhispanic | M | Chelmsford Massachusetts US | 952 Wyman Frontage road Suite 59 | Boston | Massachusetts | Suffolk County | 2136.0 | 42.336430 | -71.056928 |
| 967 | 7506d350-0f35-7c82-3b8a-d7aa80114352 | 1942-05-18 | 2018-03-29 | Mrs. | Maren639 | Breitenberg711 | NaN | Ritchie586 | M | white | nonhispanic | F | Rehoboth Massachusetts US | 460 Padberg Dale Apt 89 | Boston | Massachusetts | Suffolk County | 2131.0 | 42.325310 | -71.091714 |
| 968 | 5936f828-81d9-1a90-03b1-cfe49183dba8 | 1942-05-18 | NaT | Mrs. | Sunni15 | Olson653 | NaN | Nitzsche158 | M | white | nonhispanic | F | Boston Massachusetts US | 797 Shanahan Center | Boston | Massachusetts | Suffolk County | 2136.0 | 42.318959 | -71.051754 |
| 969 | d684571e-a784-ef61-429e-06fa0d2b1637 | 1924-03-15 | NaT | Mr. | Chris95 | Fisher429 | NaN | NaN | S | white | nonhispanic | M | Franklin Massachusetts US | 810 Yundt Forge Suite 2 | Medford | Massachusetts | Middlesex County | 2145.0 | 42.357626 | -71.040837 |
| 970 | 13c6f26e-17b7-f534-04db-78a26b26018d | 1940-10-31 | NaT | Mrs. | Berneice173 | Heaney114 | NaN | Hermiston71 | M | white | nonhispanic | F | Templeton Massachusetts US | 617 MacGyver Pathway | Boston | Massachusetts | Suffolk County | 2152.0 | 42.331490 | -71.039520 |
| 971 | 521e998b-ff0e-767f-b0ee-2bdf1168d66c | 1943-04-18 | NaT | Mr. | Jamal145 | VonRueden376 | NaN | NaN | M | white | nonhispanic | M | Chelmsford Massachusetts US | 505 Mertz Path Apt 40 | Boston | Massachusetts | Suffolk County | 2134.0 | 42.341971 | -71.040624 |
| 972 | b57e24a2-2e48-12f9-3293-c88745cfdc3f | 1941-04-28 | NaT | Mrs. | Chrissy459 | Rempel203 | NaN | Beer512 | M | asian | nonhispanic | F | Needham Massachusetts US | 366 Beer Crossroad | Cambridge | Massachusetts | Middlesex County | NaN | 42.337040 | -71.094676 |
| 973 | 204f8028-72f8-d6f8-761f-79ebf9f02311 | 1923-02-14 | NaT | Mrs. | Melaine933 | Hintz995 | NaN | Baumbach677 | M | white | nonhispanic | F | Southwick Massachusetts US | 382 Mosciski Road | Boston | Massachusetts | Suffolk County | 2128.0 | 42.357333 | -71.057372 |